USHIFT-7223: Automate Polarion tests OCP-66820 and OCP-66882#6905
Conversation
|
@agullon: This pull request references USHIFT-7223 which is a valid jira issue. Warning: The referenced jira issue has an invalid target version for the target branch this PR targets: expected the task to target the "5.0.0" version, but no target version was set. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
Note Reviews pausedIt looks like this branch is under active development. To avoid overwhelming you with review comments due to an influx of new commits, CodeRabbit has automatically paused this review. You can configure this behavior by changing the Use the following commands to manage reviews:
Use the checkboxes below for quick actions:
WalkthroughA new Robot Framework suite ChangesPrerun Data Management Tests
Estimated code review effort🎯 2 (Simple) | ⏱️ ~10 minutes Suggested reviewers
🚥 Pre-merge checks | ✅ 15✅ Passed checks (15 passed)
✏️ Tip: You can configure your own custom pre-merge checks in the settings. ✨ Finishing Touches🧪 Generate unit tests (beta)
Comment |
|
/retest |
3 similar comments
|
/retest |
|
/retest |
|
/retest |
ARM (Graviton) CI instances take significantly longer to complete reboot cycles compared to x86. SSH typically becomes available in ~60-120s on ARM vs ~30-50s on x86. When tests involve destructive operations like removing /var/lib/microshift followed by a fresh bootstrap (certificate generation, etcd initialization), the total reboot cycle on ARM can reach ~400s, exceeding the previous 5m (300s) timeout. This was observed in PR openshift#6905 where the new prerun-data-management tests triggered reboot timeouts exclusively on ARM jobs, while identical x86 runs passed comfortably. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> pre-commit.check-secrets: ENABLED
|
/test e2e-aws-tests-release |
|
/label tide/merge-method-squash |
|
/test e2e-aws-tests-release |
Add Robot Framework tests for two prerun data management scenarios ported from openshift-tests-private: - OCP-66820: verify that a version file 3 minors behind the running executable causes MicroShift to fail on reboot, with the failure reason logged to prerun_failed.log and reported by greenboot. - OCP-66882: verify that removing /var/lib/microshift while health.json shows healthy causes MicroShift to start fresh, ignoring stale health info. Both tests use a Reboot MicroShift Host And Wait For Greenboot keyword that tolerates greenboot rollback loops: it waits up to 10 minutes for greenboot to settle and verifies system stability by comparing boot IDs before and after a delay to detect pending rollback reboots. Test randomization is disabled in both backup scenarios because prerun-data-management.robot is destructive (deletes MicroShift data, triggers greenboot rollback loops) and must run last. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> pre-commit.check-secrets: ENABLED
|
/test e2e-aws-tests-release |
Run "Data Missing" before "Version Too Old" so the healthy reboot hits a fresh system. "Version Too Old" triggers greenboot rollback loops that degrade ARM I/O performance, causing subsequent healthy reboots to exceed the 5-minute timeout. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> pre-commit.check-secrets: ENABLED
|
/test e2e-aws-tests-release |
|
/override ci/prow/e2e-aws-tests-arm |
|
@agullon: Overrode contexts on behalf of agullon: ci/prow/e2e-aws-tests-arm, ci/prow/e2e-aws-tests-bootc-release-el10 DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
|
@agullon: all tests passed! Full PR test history. Your PR dashboard. DetailsInstructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here. |
|
/verified by CI |
|
@agullon: This PR has been marked as verified by DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
/lgtm |
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: agullon, kasturinarra The full list of commands accepted by this bot can be found here. The pull request process is described here DetailsNeeds approval from an approver in each of these files:
Approvers can indicate their approval by writing |
Summary
Automate two Polarion test cases ported from openshift-tests-private:
prerun_failed.logand reported by greenboot/var/lib/microshiftwith healthyhealth.jsoncauses MicroShift to start freshTests are added to
el98-src@backup-and-restore-on-reboot(presubmits) andel98-lrel@backups(releases).Greenboot handling
Both tests can trigger greenboot rollback loops (intentional prerun failure, slow fresh bootstrap on ARM). A
Reboot MicroShift Host And Wait For Greenbootkeyword waits up to 10m for greenboot to settle and verifies system stability by comparing boot IDs before and after a delay to detect pending rollback reboots.Test randomization is disabled in both scenarios because the prerun tests are destructive (data deletion, greenboot rollback loops) and must run last to avoid destabilizing subsequent test suites.
Test plan
el98-src@backup-and-restore-on-rebootpasses on x86 and ARMel98-lrel@backupspasses on x86 and ARM